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Abstract 

A wide range of constraints can be compactly specified using 
automata or formal languages. In a sequence of recent papers, 
we have shown that an effective means to reason with such 
specifications is to decompose them into primitive constraints 
( |Quimper & Walsh 2006| |Quimper & Walsh 2007| >. We can 
then, for instance, use state of the art SAT solvers and profit 
from their advanced features like fast unit propagation, clause 
learning, and conflict-based search heuristics. This approach 
holds promise for solving combinatorial problems in schedul- 
ing, rostering, and configuration, as well as problems in more 
diverse areas like bioinformatics, software testing and natural 
language processing. In addition, decomposition may be an 
effective method to propagate other global constraints. 



Introduction 

Constraint programming is an expressive and efficient tech- 
nology to solve a wide range of planning, scheduling, 
routing, and configuration problems. However, constraint 
solvers are still some distance from the "model and run" 
capability of solvers for mixed integer programming (MIP) 
and propositional satisfiability (SAT). It requires consider- 
able effort and expertise to model a problem so that it can 
be solved using a constraint solver. A major direction of 
research is therefore developing new ways for the user to 
state their problem constraints that can then be efficiently 
reasoned about automatically. 

Tools from formal language theory (that is, automata 
and grammar rules) are useful to specify many types 
of constraints. Recently, we have shown that decom- 
positions can be used to propagate such specifications 
( |Quimper & Walsh 2006| |Quimper & Walsh 2007) . Our re- 
sults demonstrate that such decompositions are highly com- 
petitive with optimized code, but are easy to implement and 
benefit from the current (and future) advances in SAT solv- 
ing technology. We believe that such methods may be of 
interest to researchers in other areas like natural language 
processing, software testing, and bioinformatics where au- 
tomata and grammar rules are already used for problem 
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specification and where combinatorial problems naturally 
arise. 

Background 

A constraint satisfaction problem (CSP) consists of a set of 
variables, each with a finite domain of values, and a set of 
constraints. A constraint restricts values taken by some sub- 
set of variables to a subset of the Cartesian product of their 
domains. A solution is an assignment of one value to each 
variable satisfying all the constraints. Systematic constraint 
solvers typically construct partial assignments using back- 
tracking search and constraint propagation to prune variable 
assignments which cannot be in any solution. 

We will consider constraints which are specified in terms 
of a grammar or automaton which accepts just valid assign- 
ments for a sequence of variables. Regular languages are 
precisely those accepted by a deterministic finite automaton. 
A deterministic finite automaton (DFA) ft is given by a 5- 
tuple (Q, S, T, qo, F) where Q is a finite set of states, S is an 
alphabet, T : S x Q i— > Q is the transition function, qo G Q 
is the initial state and F C Q is the set of final (or accept- 
ing) states. Context-free languages are above regular lan- 
guages in the Chomsky hierarchy. Context-free languages 
are exactly those accepted by non-deterministic pushdown 
automaton (that is, a automaton with a stack onto which we 
can push and pop values, and a non-deterministic choice of 
transitions). A context-free language can be specified by 
a set of grammar rules in which the left-hand side has just 
one non-terminal, and the right-hand side may have a string 
of terminals and non-terminals. Any context-free grammar 
can be written in Chomsky normal form in which each rule 
yields either just one terminal or two non-terminals. 

An example 

Consider the shift- scheduling benchmark introduced 
in ( |Demassey, Pesant, & Rousseau 2006 1. The schedule of 
an employee in a company is subject to the following rules. 
An employee either works on an activity au has a break (6), 
has lunch (J), or rests (r). When working on an activity, the 
employee works for a minimum of one hour. An employee 
can change activities after a break or a lunch. A break is 
fifteen minute long and a lunch is one hour long. Lunches 
and breaks are scheduled between periods of work. 



Employees can be part-time or full-time. A part-time em- 
ployee works at least three hours but less than six hours a day 
and has one break. A full-time employee works between six 
and eight hours a day and have a break, a lunch, and a break 
in that order. Employees rest at the beginning and the end of 
the day. At some time of the day, the business is closed and 
employees must either rest, break, or have lunch. The day is 
divided into 96 time slots of 15 minutes. During time slot t, 
at least d(t, a*) employees must be assigned to activity <Zj. 

This is a complex problem but we can model it using a 
simple formal language. For each employee, we introduce 
a sequence of 96 variables (one per time slot) whose values 
must spell out a string defined by the following grammar G. 

S -> RPR | RFR 
R — » rR r L — > IL \ I A4 —> a{Ai | a, 
W —> Ai P -> WbW F -> PLP 

S is the unique starting symbol. R represents a period of 
rest. P represents a period of work by a part-time employee. 
F represents a period of work by a full-time employee. W 
represents a period of work on one activity. L represents a 
lunch break. 

We also add restrictions on some of the productions. We 
attach a Boolean function /jv(i,j) to any non-terminal N 
in a production where i represents the time period of the 
start of the non-terminal and j represents the length of 
the non-terminal. For example, with W — > Aj, we have 
fw(i,j) = J > 4 since an employee works on an activ- 
ity for at least one continuous hour. In F — > PLP, we 
have ft(i,j) = {j = 4) since a lunch is one hour long. 
In S -> RPR, we have f P (i,j) = 13 < j < 24 since a 
part-time employee works at least three hours and at most 
six hours plus a fifteen minute break. In S — > RFR, we 
have ./>(&, j) = 30 < j < 38 which represents between 
six and eight hours of work plus an hour and a half of idle 
time for the lunch and the breaks. Finally, the productions 
Ak — * a-kAk I cifc are constrained with f^ h (i, j) = open(i) 
where open(t) returns true if t is within business hours. 
Such restrictions can greatly reduce the size of the gram- 
mar needed but do not increase the asymptotic complexity 
of reasoning about the GRAMMAR constraint. 

When solving the problem with m employees, the model 
consists of 771 sequences subject to a constraint that each 
spells out a string in this language. To ensure sufficient 
workers available for activity a, at time t, we also post the 
constraint that £\ t, a.;) > d(t, etj) where x(j, t, c) is an 

0/1 variable set to 1 iff the t th character of the jth sequence 
is c. As we argue in the next sections, decomposition into 
SAT is an effective method to reason about constraints spec- 
ified in this way. 

Regular constraint 

We start with one of the simplest but nevertheless most 
useful methods so far proposed for specifying constraints 
by means of a formal language. The global constraint 
Regular([ATi, . . . , X n ], fl) ensures that the values taken 
by a sequence of n variables form a string accepted by the 
finite automaton il (Pesa nt2004| ). For example, consider the 



constraint that an employee starts off with a period of rest 
(r), but once working (w) remains so until they again rest. 
We can specify this with the automaton which starts in state 
q r , stays in this state with r but moves to state q w with w. 
From q w , the automaton stays in this state with w but moves 
to state q r i with r. Once in state q r >, only r is accepted. 
This defines the regular language r*w*r* which models this 
constraint on work and rests. 

The Regular constraint can be used to encode a wide 
variety of useful global constraints like the STRETCH con- 
straint dPesant 2 004) (which can specify constraints on the 
length an employee works a continuous stretch of shifts), 
and the PRECEDENCE constraint (which breaks value sym- 
metry) dLaw& Lee 20041 I Walsh 20061 The Regular 
constraint can be decomposed into a simple sequence of 
ternary constraints (Quimper & Walsh 20061. We merely 



need to introduce finite domain variables, Qi which stand 
for the state of the automaton after i symbols, and post 
ternary constraints between variables representing neigh- 
bouring states to ensure appropriate transitions occur. This 
decomposition is a highly efficient and effective means to 
propagate the REGULAR constraint. The decomposition 
does not hinder pruning which takes 0(nd\Q\) time where 
n is the length of the sequence, d is the domain size and \Q\ 
is the number of states of the automaton. This is asymp- 
totically identical to the time complexity of the more com- 
plex monolithic propagator proposed by Pesant based on dy- 
namic programming (Pesant 2004). More recently, Bacchus 
has proposed a SAT decomposition of the REGULAR con- 
straint (Bacchus 2007). This can be seen as the SAT encod- 
ing of the ternary transition constraints in our decomposi- 
tion. 

Another advantage of our decomposition is that we have 
explicit access to the states of the automaton. Consider, for 
example, a rostering problem where workers are allowed to 
work for up to three consecutive shifts and then must take a 
break. This can be specified with a simple REGULAR lan- 
guage constraint. Suppose we want to minimize the number 
of times a worker has to work for three consecutive shifts. 
To model this, we can impose a global cardinality constraint 
on the state variables to count the number of times we visit 
the state representing three consecutive shifts, and minimize 
the value taken by this variable. It is much more complex 
to specify such an optimization constraint when the states of 
the automaton are not represented explicitly in the model. 

Extensions of the Regular constraint 

Whilst deterministic finite automaton can in theory spec- 
ify any type of constraint, such specifications may not 
be compact. We therefore proposed a number of ex- 
tensions including regular languages specified by non- 
deterministic finite automata, and soft and cyclic versions 
of the Regular constraint ( [Quimper & Walsh 2 006). For 
instance, if a problem is over-constrained, we might want 
to insist that we are "near" to a string in the regular 
language, van Hoeve, Pesant and Rousseau have pro- 
posed a generalization of the REGULAR constraint to deal 
with such situations ( [van Hoeve, Pesant, & Rousseau 2 006). 
REGULAR soft ([Xi, . . . ,X n ], N, tt) holds iff the values 



taken by X\ to X n form a string that is at most distance 
N from a string accepted by the DFA given by fi. Distance 
is either Hamming distance (giving the usual variable-based 
costs) or edit distance (which may be more useful in cer- 
tain circumstances). In ( |Quimper & Walsh 2006[ ), we give 
encodings of such soft REGULAR constraints. 

As a second example, we may want to find a repeat- 
ing sequence. We therefore introduced cyclic forms of the 
Regular constraint ( |Quimper & Walsh 2006[ ). In a roster- 
ing problem where the shift pattern is repeated every four 
weeks, such a constraint can be used to ensure that shifts 
changes only according to a set of valid patterns (e.g. a night 
shift is only followed by another night shift or a rest day, and 
is not followed by a day shift, even at the end of the fourth 
week when we repeat back to the first shift). 

Grammar constraint 

Moving above regular languages in the Chomsky hierar- 
chy are context free grammars. The GRAMMAR con- 
straint ( Sellmann 2006 ; Quimper & Walsh 2006 ) permits us 



to specify constraints using any context-free grammar. Al- 
though context-free grammars are more complex to reason 
about than regular languages (e.g. parsing goes from 0(n) 
time for regular languages to 0(n 3 ) time for context-free 
languages), they may compensate by requiring an exponen- 
tially smaller specification. Since parsing (and propagation) 
depends linearly on the size of the grammar, such reductions 
in the size of the grammar can be of considerable benefit. 

Context-free grammar constraints may have applications 
in a number of areas including: 

Rostering and car sequencing: to express constraints that 
are not compactly expressible using a regular language as 
in our earlier example; 

Configuration: to capture the hierarchically structure of a 
product (e.g. the computer consists of a motherboard, and 
input and output devices, the motherboard itself consists 
of a CPU and memory, the CPU is an Intel or an AMD 
processor, etc.); 

Software verification: to represent constraints on the pos- 
sible inputs to a program for fuzz testing; 

Bioinformatics: to express patterns in genes and other 
types of sequences (e.g. context-free grammars are 
needed to represent palindromes); 

Natural language processing: to choose between different 
possible parsings. 

([Quimper & Walsh 2007 



In 



ICote et al. 20071) . 
Grammar constraints have been used to model com- 
plex shift-scheduling problems. To reason about such 
Grammar constraints, we developed two propa- 
gators based on the CYK and Earley chart parsers 
(Quimper & Walsh 2006). Both use dynamic programming. 
Whilst the CYK propagator takes 0(n 3 ) time, the propa- 
gator based on the Earley chart parser is just 0(n 3 ) and 
is not restricted to grammars in Chomsky normal form. 
More promising still, we have proposed a simple AND/OR 
decomposition based on the CYK parser which can be 
encoded into SAT (Quimper & Wals h 2QQ1) . We have 



shown that this decomposition does not hinder propagation 
and is asymptotically as fast as the monolithic propagator 
based on the Earley chart parser. To be more precise, unit 
propagation on this decomposition will prune all possible 
values in the same asymptotic time. Simpler grammars 
can also give a smaller decomposition. For instance, the 
decomposition is just linear on a regular grammar. 

Decomposing global constraints in this way brings sev- 
eral other advantages. First, we can easily add this global 
constraint to any constraint solver. For example, we used 
the decomposition to add the GRAMMAR constraint to both 
a standard constraint toolkit and a state of the art SAT solver. 
Second, decomposition gives an efficient incremental prop- 
agator. The solver can simply wake up just those constraints 
containing variables whose domains have changed, ignor- 
ing those parts of the decomposition that do not need to be 
propagated. Here, for example, we get the first incremental 
propagator for the GRAMMAR constraint, with a worst case 
cost down a whole branch of the search tree that is just the 
same as calling the propagator once. Third, decomposition 
gives a propagator which we can backtrack over efficiently. 
Modern SAT and CSP solvers use watch literals so that we 
can backtrack one level up the search tree in constant time. 
Fourth, decomposition opens up a number of other possi- 
bilities which we are only starting to explore. For exam- 
ple, it may make it easier to construct no-goods, as well as 
cost measures for over-constrained problems. Finally, a de- 
composition may make it easier to construct constraint based 
branching heuristics. 

MIP encodings 

More recently, Cote, Gendron, Quimper and Rousseau 
have proposed mixed-integer programming (MIP) en- 
codings of the Regular and Grammar constraints 
dCote et al. 20071 1. The MIP encoding of the REGULAR con- 
straint introduces linear inequalities to model the flow con- 
structed by unfolding the automaton into a layered transition 
graph. When this is the only constraint in a problem, this can 
be solved with a specialized path finding algorithm. How- 
ever, when there are other constraints in the problem, it can 
be solved with a more general 0/1 MIP solver. The MIP 
encoding of the GRAMMAR constraint introduces linear in- 
equalities which are derived from our AND/OR decompo- 
sition of the CYK propagator. The MIP encoding has one 
significant difference. If there is more than one parsing for a 
sequence, it picks one arbitrarily whilst the CYK propagator 
keeps all. This simplifies the MIP encoding without chang- 
ing the set of solutions since only one parsing is needed to 
show membership in a context-free grammar. Experiments 
on a shift scheduling problem show that such MIP encod- 
ings are highly competitive with other MIP formulations of 
the problem. They open the door to specifying complex shift 
scheduling rules using simple tools from formal language 
theory, and solving these problem with fast MIP solvers. 

Related work 

Vempaty introduced the idea of representing the so- 
lutions of a CSP by a deterministic finite automa- 



ton (Vempaty 1992). Such automaton can be used 
to answer questions about satisfiability, validity and 
equivalence. Amilhastre generalized these ideas to 
non-deterministic automata, and proposed heuristics to 
minimize the size of the automata (Amilhastre 1999). 
This approach was then applied to configuration prob- 
lems (Amilhastre, Fargier, & Marquis 2002). Boigelot and 
Wolper developed decision procedures for arithmetic con- 



straints based on automata (Boigelot & Wolper 2002). 

Carlsson and Beldiceanu derived a propagation al- 
gorithm for a chain of lexicographical ordering con- 
straints based on a deterministic finite automaton 
Carlsson & Beldiceanu 20021 ). For the REGULAR 
constraint, a propagation algorithm based on dynamic 
programming was given in (Pesant 2004). Coincidently 
Beldiceanu, Carlsson and Petit proposed specifying global 
constraints by means of deterministic finite automaton aug- 
mented with counters (Beldi ceanu, Carlsson, & Petit 2 004). 
Propagators for such automaton are constructed automati- 
cally from the specification of the automaton by constructing 
a conjunction of signature and transition constraints. At the 
same time as (Quimper & Walsh 2006 ), Sellmann proposed 
the Grammar constraint and gave a monolithic propagator 
based on the CYK parser ([Sellmann 2006). Quimper and 
Rousseau ( Quimper & Rousseau 2007 ) used automta and 
context-free grammars as an operator for a large neighbour- 
hood local search. Finally, Golden and Pang propose the 
use of string variables which are specificed using regular 
expressions or automata and show how to enforce GAC on 
matching, containment, cardinality and other constraints 



(Golden & Pang 2003). 



Conclusions 

Grammar constraints specify that a sequence of vari- 
ables are restricted to values spelling out a string within 
a given language. Such constraints are useful in a 
wide range of scheduling, rostering and sequencing prob- 
lems. We have shown that decomposition is an effi- 
cient and effective method to reason about such constraints 



(Qui mper & Wa lsh 2006; Quimper & Walsh 2007 ). This an 
easy means to incorporate such grammar constraints into 
constraint toolkits and SAT solvers. We believe that this ap- 
proach holds promise for a wide range of other areas like 
bioinformatics and natural language processing. Another 
promising direction is to learn grammar constraints from ex- 
amples. We can, for instance, leverage on results and algo- 
rithms from grammar induction. For example, due to Gold's 
theorem, it will not be possible to learn a REGULAR con- 
straint from just positive examples. Finally, decomposition 
into SAT may prove effective for propagating other global 
constraints. 
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